[GLUTEN-11402][VL] Fix decimal partition key serialization to preserve scale by acvictor · Pull Request #11618 · apache/gluten

acvictor · 2026-02-15T15:13:28Z

What changes are proposed in this pull request?

This PR fixes decimal partition value serialization by replacing toJavaBigInteger.toString with toJavaBigDecimal.unscaledValue().toString, removes fallback guard that was added by #11518 and adds additional test cases to SQLQuerySuite covering small decimals, zero-scale decimals, negative values, and multi-partition pruning.

How was this patch tested?

Existing UTs added in #11518 + extended Incorrect decimal casting for partition read test

Was this patch authored or co-authored using generative AI tooling?

No

Related issue: #11402

github-actions · 2026-02-15T15:13:56Z

Run Gluten Clickhouse CI on x86

github-actions · 2026-02-15T15:23:19Z

Run Gluten Clickhouse CI on x86

github-actions · 2026-02-15T16:47:08Z

Run Gluten Clickhouse CI on x86

acvictor · 2026-02-15T18:33:26Z

@baibaichen @zhouyuan can you please review? Thanks!

acvictor · 2026-02-16T06:13:38Z

cc @Surbhi-Vijay

acvictor · 2026-02-28T11:46:30Z

@zhouyuan ping on a review for this, thanks 😊

zhouyuan · 2026-03-03T10:55:10Z

@acvictor Thanks for the fix. The code looks good. However in the log, it seems there are still some fallback on scan reported, is this expected?
https://github.com/apache/incubator-gluten/actions/runs/22039361874/job/63677680172?pr=11618#step:8:8427

github-actions · 2026-03-03T11:47:47Z

Run Gluten Clickhouse CI on x86

acvictor · 2026-03-03T14:28:36Z

@acvictor Thanks for the fix. The code looks good. However in the log, it seems there are still some fallback on scan reported, is this expected? https://github.com/apache/incubator-gluten/actions/runs/22039361874/job/63677680172?pr=11618#step:8:8427

@zhouyuan this is expected.

Baseline

26/03/02 15:03:48 WARN GlutenFallbackReporter: Validation failed for plan: Exchange[QueryId=34], due to: [FallbackByBackendSettings] Validation failed on node Exchange
26/03/02 15:03:48 WARN GlutenFallbackReporter: Validation failed for plan: Project[QueryId=34], due to: 
 - Validation failed with exception from: ProjectExecTransformer, reason: CheckOverflowInTableInsert is used in ANSI mode, but Gluten does not support ANSI mode.
E20260302 15:03:48.568771 27742 Exceptions.h:53] Line: /work/ep/build-velox/build/velox_ep/velox/exec/Task.cpp:2464, Function:terminate, Expression:  Cancelled, Source: RUNTIME, ErrorCode: INVALID_STATE
26/03/02 15:03:48 WARN GlutenFallbackReporter: Validation failed for plan: Scan parquet spark_catalog.default.dynparttest2, due to: 
 - Unsupported decimal partition column in native scan.
26/03/02 15:03:48 WARN GlutenFallbackReporter: Validation failed for plan: ColumnarToRow, due to: 
 - Unsupported decimal partition column in native scan.
26/03/02 15:03:48 WARN GlutenFallbackReporter: Validation failed for plan: Scan parquet spark_catalog.default.dynparttest2[QueryId=36], due to: 
 - Unsupported decimal partition column in native scan.
26/03/02 15:03:48 WARN GlutenFallbackReporter: Validation failed for plan: ColumnarToRow[QueryId=36], due to: 
 - Unsupported decimal partition column in native scan.
- Incorrect decimal casting for partition read

This PR

26/02/15 17:07:03 WARN GlutenFallbackReporter: Validation failed for plan: Exchange[QueryId=34], due to: [FallbackByBackendSettings] Validation failed on node Exchange
26/02/15 17:07:03 WARN GlutenFallbackReporter: Validation failed for plan: Project[QueryId=34], due to: 
 - Validation failed with exception from: ProjectExecTransformer, reason: CheckOverflowInTableInsert is used in ANSI mode, but Gluten does not support ANSI mode.
E20260215 17:07:03.596613 27433 Exceptions.h:53] Line: /work/ep/build-velox/build/velox_ep/velox/exec/Task.cpp:2455, Function:terminate, Expression:  Cancelled, Source: RUNTIME, ErrorCode: INVALID_STATE
26/02/15 17:07:03 WARN GlutenFallbackReporter: Validation failed for plan: Exchange[QueryId=40], due to: [FallbackByBackendSettings] Validation failed on node Exchange
26/02/15 17:07:03 WARN GlutenFallbackReporter: Validation failed for plan: Project[QueryId=40], due to: 
 - Validation failed with exception from: ProjectExecTransformer, reason: CheckOverflowInTableInsert is used in ANSI mode, but Gluten does not support ANSI mode.
E20260215 17:07:04.033113 27433 Exceptions.h:53] Line: /work/ep/build-velox/build/velox_ep/velox/exec/Task.cpp:2455, Function:terminate, Expression:  Cancelled, Source: RUNTIME, ErrorCode: INVALID_STATE
26/02/15 17:07:04 WARN GlutenFallbackReporter: Validation failed for plan: Exchange[QueryId=46], due to: [FallbackByBackendSettings] Validation failed on node Exchange
26/02/15 17:07:04 WARN GlutenFallbackReporter: Validation failed for plan: Project[QueryId=46], due to: 
 - Validation failed with exception from: ProjectExecTransformer, reason: CheckOverflowInTableInsert is used in ANSI mode, but Gluten does not support ANSI mode.
E20260215 17:07:04.451627 27433 Exceptions.h:53] Line: /work/ep/build-velox/build/velox_ep/velox/exec/Task.cpp:2455, Function:terminate, Expression:  Cancelled, Source: RUNTIME, ErrorCode: INVALID_STATE
26/02/15 17:07:04 WARN GlutenFallbackReporter: Validation failed for plan: Exchange[QueryId=52], due to: [FallbackByBackendSettings] Validation failed on node Exchange
26/02/15 17:07:04 WARN GlutenFallbackReporter: Validation failed for plan: Project[QueryId=52], due to: 
 - Validation failed with exception from: ProjectExecTransformer, reason: CheckOverflowInTableInsert is used in ANSI mode, but Gluten does not support ANSI mode.
E20260215 17:07:04.846966 27433 Exceptions.h:53] Line: /work/ep/build-velox/build/velox_ep/velox/exec/Task.cpp:2455, Function:terminate, Expression:  Cancelled, Source: RUNTIME, ErrorCode: INVALID_STATE
26/02/15 17:07:05 WARN GlutenFallbackReporter: Validation failed for plan: Exchange[QueryId=58], due to: [FallbackByBackendSettings] Validation failed on node Exchange
26/02/15 17:07:05 WARN GlutenFallbackReporter: Validation failed for plan: Project[QueryId=58], due to: 
 - Validation failed with exception from: ProjectExecTransformer, reason: CheckOverflowInTableInsert is used in ANSI mode, but Gluten does not support ANSI mode.
E20260215 17:07:05.233858 27433 Exceptions.h:53] Line: /work/ep/build-velox/build/velox_ep/velox/exec/Task.cpp:2455, Function:terminate, Expression:  Cancelled, Source: RUNTIME, ErrorCode: INVALID_STATE
26/02/15 17:07:05 WARN GlutenFallbackReporter: Validation failed for plan: Exchange[QueryId=59], due to: [FallbackByBackendSettings] Validation failed on node Exchange
26/02/15 17:07:05 WARN GlutenFallbackReporter: Validation failed for plan: Project[QueryId=59], due to: 
 - Validation failed with exception from: ProjectExecTransformer, reason: CheckOverflowInTableInsert is used in ANSI mode, but Gluten does not support ANSI mode.
E20260215 17:07:05.426112 27433 Exceptions.h:53] Line: /work/ep/build-velox/build/velox_ep/velox/exec/Task.cpp:2455, Function:terminate, Expression:  Cancelled, Source: RUNTIME, ErrorCode: INVALID_STATE
26/02/15 17:07:05 WARN GlutenFallbackReporter: Validation failed for plan: Exchange[QueryId=61], due to: [FallbackByBackendSettings] Validation failed on node Exchange
26/02/15 17:07:05 WARN GlutenFallbackReporter: Validation failed for plan: Exchange[QueryId=62], due to: [FallbackByBackendSettings] Validation failed on node Exchange
- Incorrect decimal casting for partition read

The Exchange/Project fallbacks with CheckOverflowInTableInsert are pre-existing on the INSERT path and the baseline also has this. This PR has more instances because I extended the test to go from 1 INSERT to 6 INSERTs to cover additional decimal scenarios. The logs do show an improvement from the baseline, because Scan parquet spark_catalog.default.dynparttest2 was previously falling back with "Unsupported decimal partition column in native scan." but in this PR, that scan fallback is eliminated.

beliefer · 2026-03-04T07:35:26Z

backends-velox/src/main/scala/org/apache/gluten/backendsapi/velox/VeloxIteratorApi.scala

                DateFormatter.apply().format(pv.asInstanceOf[Integer])
              case _: DecimalType =>
-                pv.asInstanceOf[Decimal].toJavaBigInteger.toString
+                pv.asInstanceOf[Decimal].toJavaBigDecimal.unscaledValue().toString


Could you explain why decimal partition keys are not supported?

It's not that it's unsupported but rather results in incorrect casting (see #11618). The change is needed because Decimal.toJavaBigInteger truncates the fractional part, producing an incorrect unscaled value. For example, a Decimal("100.1") with scale=1 would serialize as "100" (the truncated BigInteger) instead of "1001" (the correct unscaled representation). This causes Velox reader to misinterpret decimal partition values, returning wrong query results.

beliefer

LGTM if tests passed.

github-actions · 2026-03-04T13:32:40Z

Run Gluten Clickhouse CI on x86

acvictor · 2026-03-05T17:11:37Z

@zhouyuan does this change look good to you?

zhouyuan

👍 Thanks for the fix!

github-actions bot added CORE works for Gluten Core VELOX labels Feb 15, 2026

acvictor force-pushed the acvictor/decimalPartition branch from 8d961b0 to d0172fa Compare February 15, 2026 15:22

acvictor force-pushed the acvictor/decimalPartition branch from d0172fa to f97d6e1 Compare February 15, 2026 16:46

acvictor marked this pull request as ready for review February 15, 2026 18:33

baibaichen requested review from beliefer and zhouyuan and removed request for zhouyuan March 3, 2026 11:45

baibaichen force-pushed the acvictor/decimalPartition branch from f97d6e1 to a15475e Compare March 3, 2026 11:47

beliefer reviewed Mar 4, 2026

View reviewed changes

beliefer approved these changes Mar 4, 2026

View reviewed changes

Fix decimal partition key serialization

55820f9

baibaichen force-pushed the acvictor/decimalPartition branch from a15475e to 55820f9 Compare March 4, 2026 13:32

zhouyuan approved these changes Mar 5, 2026

View reviewed changes

zhouyuan merged commit a96acea into apache:main Mar 5, 2026
61 of 62 checks passed

acvictor deleted the acvictor/decimalPartition branch March 5, 2026 18:13

Conversation

acvictor commented Feb 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes are proposed in this pull request?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

github-actions bot commented Feb 15, 2026

Uh oh!

github-actions bot commented Feb 15, 2026

Uh oh!

github-actions bot commented Feb 15, 2026

Uh oh!

acvictor commented Feb 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

acvictor commented Feb 16, 2026

Uh oh!

acvictor commented Feb 28, 2026

Uh oh!

zhouyuan commented Mar 3, 2026

Uh oh!

github-actions bot commented Mar 3, 2026

Uh oh!

acvictor commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

beliefer Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

acvictor Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

beliefer left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Mar 4, 2026

Uh oh!

acvictor commented Mar 5, 2026

Uh oh!

zhouyuan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

acvictor commented Feb 15, 2026 •

edited

Loading

acvictor commented Feb 15, 2026 •

edited

Loading

acvictor commented Mar 3, 2026 •

edited

Loading

beliefer Mar 4, 2026 •

edited

Loading